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The aim of this study was to appraise whether 
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mid 1970s varied in their correlations with academic performance 
criteria in the same cohort of examinees. A 1975 form and a 1985 form 
were administered to equivalent samples of high school juniors, and 
self-reported grade-point average and high school rank were obtained. 
The SAT Verbal and Mathematical scores generally had similar 
correlations with the grade criteria in the two samples, but the 
Verbal score had a significantly higher correlation with school rank 
in the 1985 samplo. The principal conclusion is that the 1975 form of 
the SAT does not have greater validity than the 1985 form in 
assessing academic performance, at least at the high school level. 
This outcome offers no support for the hypothesis that the decline in 
the SAT's ability to predict college grades since the mid 1970s, 
observed in recent research, is attributable to changes in the test. 
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Abstract 

The aim of this study was to appraise whether different forms of the SAT 
used since the mid 1970s varied In their correlations with academic 
performance criteria In the same cohort of examinees. A 1975 form and a 1985 
form were administered to equlvale.it samples of high school juniors, and self- 
reported grade -point average and high school rank were obtained. The SAT 
Verbal and Mathematical scores generally had similar correlations with the 
grade criteria in the two samples, but the Verbal score had a significantly 
higher correlation with school rank in the 1985 sample. The principal 
conclusion is that the 1975 form of the SAT does not have greater validity 
than the 1985 form in assessing academic performance, at least at the high 
school level. This outcome offers no support for the hypothesis that the 
decline in che SAT's ability to predict college grades since the mid 1970s, 
observed in recent research, is attributable to changes in the test. 



SAT Scores and Academic Performance In High School 
Recent research by Ramlst (1984) and Morgan (1989) has docvunented 
systematic trends In the SAT's validity In predicting freshman grade-point 
average (GPA) In college. This work, using data from the Validity Study 
Service (VSS) on the correlations between the SAT and GPA within colleges 
participating In that service, found that the validity Increased In the early 
1970s and decreased sine 3 then. 

The Interpretation of those trends Is complicated by a variety of 
changes that may have occurred during that same period of time. Obvious 
possibilities Include changes In the samples of schools participating in 
the VSS, their student bodies, the SAT and its use in college admission, 
and grading practices (Morgan, 1989). The Morgan findings indicate that 
statistical analyses which take into account changes in the samples of schools 
and their student bodies reduce, but do not eliminate the decline in validity 
from the mid 1970s. 

The present study addresses the possibility that changes in the SAT, 
whether intended or not, may be Implicated in the decline from the mid 1970s. 
Some explicit changes have taken place in the test content and the test 
construction procedures, but it is uncertain that they are sufficient to 
affect predictive validity (see the review by Marco, Crone, Braswell, Curley, 
& Wright, in press). All the changes appear to be relatively minor, as can be 
judged from the following summary of the most notable ones. Starting in 1977, 
one reading passage for the reading comprehension items has had content of 
special relevance to members of minority groups. In 1978, the five long 
reading passages for the reading comprehension items were replaced by three 
long and three medium- length passages, the additional passage having science 
content. In 1982, tht! number of medium- difficulty items in the Verbal section 
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were increased, and the number of very difficult and very easy items were 
reduced. Finally, sensitivity reviews of the items to eliminate content that 
might be offensive to women and members of minority groups began on a routine 
basis in 1980, 

The specific purpose of this study was to assess whether different forms 
of the SAT used since the mid 1970s varied in their correlations with academic 
performance criteria in the same cohort of examinees. Using the seune cohort 
eliminates the effects of variations in schools, student bodies, and grading 
practices that complicate interpretation of both the Ramist and the Morgan 
studies of different cohorts. The criteria in question concern high school 
performance. The SAT is designed to predict college performance, but the 
SAT's relationships with performance in other academic settings are also 
relevant from the standpoint of the test's construct validity. Some evidence 
already exists about the SAT's associations with high school performance. 
Most pertinent is the research by N, W, Burton (personal communication, 
September 15, 1989), which found substantial correlations (rs in the ,40s and 
.50s) between the SAT and self -reported GPA and high school rank for 1975, 
1980, and 1985 College Bound Seniors as well as a progressive decline in the 
correlations with GPA but not high school rank. Of course, the use of 
different cohorts complicates interpretation of the trend data. 

Method 

Data were drawn from a larger study, in which an SAT form initially used 
in 1975 (XSA2) and an SAT form Just employed in 1985 (3HSA02) were 
administered in Spring 1989 to Juniors in 862 high schools. The two forms 
constituted two of thirty spirals of actual and experimental SAT sections 
given with standard SAT test administration procedures. The schools and the 
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students volunteered to take part in the study (the latter were promised 
feedback about their test results). Self-reports of GPA and school rank, 
as well as other student characteristics, were also collected during the 
testing.' Data on characteristics of the schools were obtained from the AI 
Profile Survey conducted in 1988. 

Usable data were obtained for 1,554 students in the 1975 sample and 
1,753 in the 1985 sample, after excluding 427 students and 460 students from 
the respective samples. A total of 147 students in the 1975 sample and 151 
in the 1985 sample were excluded because their schools departed from standard 
test administration procedures or participated in a pretest for the study, 
possibly exposing the students to the same tests; the students were not 
juniors planning to attend college within two years; or the students attempted 
three or fewer items in a test section. An additional 280 students in the 
1975 sample and 309 in the 1985 sample were excluded because they did not 
report, in response to a question about their level of effort on the test, 
that they tried to do their best on "all or most of the questions" or "those 
questions that seemed easy or moderately difficult" (i.e., they chose a 
response indicative of a lower level of effort or did not respond at all) 
Outlier analyses were also carried out with the remaining students to identify 
students with discrepant formula scores on the corresponding Verbal and 
Mathematical sections. No students with outlying scores were identified, 
using a test of discordancy for multivariate normal samples (Barnett 6e Lewis, 
1985), with a famllywise significance level of .25. 

The characteristics of the samples are summarized In Table 1. The two 
samples were highly similar, both with regard to the characteristics of the 
students themselves and their schools. 

o 8 
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Insert Table 1 about here 



For each SAT form, converted Verbal and Mathematical scores, as well as 
formula scores for each item type (Antonyms, Analogies, Sentence Completion, 
Reading Compruaension, Regular Mathematics, Quantitative Comparisons) were 
obtained. 

GPA was scaled from 4.3 (A+) to 0 (E/F). School rank was scaled from 4 
(top 1% to 3%) to 1 (bottom third), each score representing a standard 
deviation interval (+1.5 to +2.5 to -.5 to -1.5) for the normal deviates 
corresponding to the percentile ranks. ^ School rank was excluded for students 
from schools with fewer than 25 students in the junior class because of the 
uncertain applicability of this variable in small classes. GPA and school 
rank were also combined in a grade composite, standardizing the GPA and school 
rank scores and computing their mean. (When one of the scores was missing, 
the grade composite was based on the available score.) 

For each sample, product -moment correlations of the Verbal and 
Mathematical scores with the grade criteria were computed, the Ns varying 
because of missing data. The correlations of the item type scores with the 
criteria were also computed for exploratory purposes. 

Results and Discussion 
The means and standard deviations of the SAT scores and the grade 
criteria are reported in Table 2 for each sample. The data for the two 
samples were generally similar, the only marked differences occurring for 
some item type scores. The 1985 sample had somewhat lower Antonyms and 
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Quantltatlve Comparisons scores and somewhat higher Analogies and Reading 
Passages scores. 



It Is noteworthy that the SAT data resemble those obtained when the test 
forms were originally administered, taking Into consideration that the 
examinees In the original administrations Included seniors as well as juniors 
(Stern & Bullock, 1976; N. Wright, D. Wright, & Weber, 1986). The SAT-V and 
SAT-M means for the 1975 sample In this study were 410 and 455, while the 
original means were 434 and 473; the means for the 1985 sample in this study 
were 405 and 453, and the original means were 423 and 470. This similarity 
suggests that the students in the study were adequately motivated, even though 
they were taking the test for research, and that the present results are 
generallzable to the regular SAT test- taking population. 

GFA, school rank, and the grade composite had generally similar 
correlations in the two samples. CPA and school rank correlated .71 in the 
1975 sample and .66 in the 1985 sample. Moreover, CPA correlated .95 with 
the grade composite in botn samples, and school rank correlated .92 and .93 
with the grade composite in the two samples. 

The correlations of the SAT scores with the grade criteria are reported 
in Table 3 for each sample. The SAT scores had generally similar correlations 
with CPA in the two samples (the sole exception was the Analogies item type 
score, which was significantly, e < -05, higher in the 1985 sample).* But 
SAT-V and two item type scores (Antonyms and Reading Comprehension) had 
significantly (2 < .05) higher correlations with school rank in the 1985 
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sample. The SAT-V correlations with school rank were .39 in the 1975 sample 
and .46 in the 1985 sample. Two item type scores (Analogies and Quantitative 
Comparisons) also had significantly (ji < .05) higher correlations with the 
grade composite in the 1985 sample. 



The principal conclusion of this study is that the 1975 form of the SAT 
does not have greater validity than the 1985 form in assessing academic 
performance, at least at the high school level. It is apparent that the 
outcome of this investigation offers no support for the hypothesis that the 
observed decline in the SAT's ability to predict college grades since the mid 
1970s is attributable to changes in the test. 

The differences in the pattern of correlations of the SAT Verbal score 
with CPA and school rank—similar correlations with CPA in the two samples but 
higher correlations with school rank in the 1985 sample- -cannot be readily 
explained. The variances for school rank were similar in the two samples, and 
the samples do not appear to differ on other potentially relevant variables. 
It is noteworthy that the depressed correlation of the Verbal score with 
school rank in the 1975 sample (relative to the corresponding correlation with 
GPA in that sample), which contributed to the 1975 vs. 1985 difference, is 
mirrored in the Burton study. The Verbal score's correlations with GPA, for 
1975 College Bound Seniors, were .50 for males and .51 for females; the 
corresponding correlations with school rank were .45 and .48. 

It is important to emphasize that the resi;<Ics of this study concern the 
comparative validity of the two SAT forms with respect to academic performance 
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in today's high schools. The findings do not necessarily reflect what the 
validity of these forms would have been at an earlier point. Depending on 
the grading practices in previous years and the match between the test content 
and the then current curriculum, the validity of the forms could by very 
different. The Burton research is pertinent in this regard. The SAT's 
correlations with GPA and school rank for the 1975 and 1985 College Bound 
Seniors were generally similar to those in this study, with the important 
exception of the Verbal score's correlations with both grade criteria in the 
1975 cohort. These correlations were substantially higher: for GPA, .50 
(men) to .51 (women) vs. .43 in the present study; for school rank, .45 (men) 
to .48 (women) vs. .39 in this investigation. 

Similarly, the present findings about the comparative validity of the 
SAT forms today do not bear on trends over time in the overlap of the SAT with 
high school GPA and school rank. Even if the 1985 form were unequivocally 
shown to have greater validity than the 1975 form against criteria of high 
school performance in 1989, it does not follow that the 1985 form is also more 
redundant with contemporary high school record variables or has less 
incremental validity vis-a-vis them. 

Finally, the reliance on self -reported grades in this study inevitably 
raises a question about their usefulness for this purpose. It is well- 
established that self-reported high school grades are relatively accurate and 
correlate appreciably with recorded college grades (see the review by Baird, 
1976), though not as highly as actual school grades (e.g., Morgan, 1989). The 
accuracy of the grades in this study is also supported by their substantial 
correlations with the SAT scores. 
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Footnotes 

^The GPA and school rank questions follow: 

Give or estimate your grade point average. 

A+ 

A 

A- 

B+ 

B 

B- 

C+ . 

C 

C- 

D+ 

D 

D- 

E/F 

In class rank, are you In the top, middle, or bottom third of your 

class? 

Top third 
Middle third 
Bottom third 
I don't know 
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If you named the top third to the previous question, about where do you 
stand in the top third? 

1-3% 

4-10% 
11-20% 
21-33% 

*rhe question follows: 

Which of the following best describes your efforts on this test? (Mark 
only one . ) 

I tried to do my best on all or most of the questions on this test. 
I tried to do my best on those questions that seemed easy or moderately 
difficult. 

I tried to do my best on those questions that were in the beginning of 
each section. 

I tried to do my best on the first sections of the test. 
I did not try to do my best on this test. 

*rhe school rank categories were quantified as follows: 4 (top "1 to 
3%"; 2.05 Z), 3 (top "4 to 10%," "11 to 20%," "21 to 33%"; 1.48 Z to .61 Z) , 
2 ("middle third"; .00 Z) , and 1 ("bottom third"; -.97 Z) . 

*These two- tail significance tests are approximations because of the 
clustering within and between samples, with more than one student from each 
school in both samples. 
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